Ranked-Listed or Categorized Results in IR
نویسندگان
چکیده
In this paper we examine the performance of both rankedlisted and categorized results in the context of known-item search (target testing). Performance of known-item search is easy to quantify based on the number of examined documents and class descriptions. Results are reported on a subset of the Open Directory classification hierarchy, which enable us to control the error rate and investigate how performance degrades with error. Three types of simulated user model are identified together with the two operating scenarios of correct and incorrect classification. Extensive empirical testing reveals that in the ideal scenario, i.e. perfect classification by both human and machine, a category-based system significantly outperforms a ranked list for all but the best queries, i.e. queries for which the target document was initially retrieved in the top-5. When either human or machine error occurs, and the user performs a search strategy that is exclusively category based, then performance is much worse than for a ranked list. However, most interestingly, if the user follows a hybrid strategy of first looking in the expected category and then reverting to a ranked list if the target is absent, then performance can remain significantly better than for a ranked list, even with misclassification rates as high as 30%. We also observe that this hybrid strategy results in performance degradations that degrade gracefully with error rate.
منابع مشابه
The Ranking of Fraudulent Financial Reporting By Using Data Envelopment Analysis: Case of Pharmaceutical Listed Companies
Fraudulent financial reporting has been one of the most sensitive issues on the business world. Financial statements that conceal the company's facts have caused great losses to its stakeholders. The ranking of companies based on fraudulent financial reporting is one of the key issues for performance analysis. This study, by using financial variables and the data envelopment analysis methodolog...
متن کاملFinding Experts By Semantic Matching of User Profiles
© Finding Experts By Semantic Matching of User Profiles Rajesh Thiagarajan, Geetha Manjunath, Markus Stumptner HP Laboratories HPL-2008-172 semantics, similarity matching, ontologies, user profile Extracting interest profiles of users based on their personal documents is one of the key topics of IR research. However, when these extracted profiles are used in expert finding applications, only na...
متن کاملHOMA-IR is associated with significant angiographic coronary artery disease in non-diabetic, non-obese individuals: a cross-sectional study
UNLABELLED Insulin resistance is a major component of metabolic syndrome, type 2 Diabetes Mellitus (T2DM) and coronary artery disease (CAD). Although important in T2DM, its role as a predictor of CAD in non-diabetic patients is less studied. In the present study, we aimed to evaluate the association of HOMA-IR with significant CAD, determined by coronary angiography in non-obese, non-T2DM patie...
متن کاملRanking Stock Exchange Companies With a Combined Approach Based on FAHP-FTOPSIS Financial Ratios and Comparing Them With Tehran Stock Exchange Rankings
Ranking of companies listed on the exchange represent their status and considered a criterion for investment. Also, it increases market's competition, development and efficiency. In this study, the fifty superior companies listed in Tehran Stock Exchange were ranked based on financial ratios (liquidity, operational, leverage and profitability) using FAHP- FTOPSIS hybrid approach during the year...
متن کاملForecasting Credit Risk in Banks Listed on Tehran Stock Exchange
The present study aim is to offer a systematic method of assessing the credit risk of banks and also to identify key indicators using Decision Making Trial and Evaluation Laboratory (DEMATEL) technique as well as using Logit Regression in order to predict the credit risk of listed banks. The population of the study consists of the legal clients of the bank (Ansar Bank, Bank Saderat Iran, Bank M...
متن کامل